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•smpiriwU  evaluation  of  PCR*  indicates  that  it  indeed  tends  to  choose  the  ap* 
propnatc  degree  of  regulariaation  for  a  collccUon  of  data  sets. 

The  current  focus  of  the  project’s  latests  rescardi  is  in  the  area  of  classifica¬ 
tion.  Here,  each  learned  model  attempts  to  assign  a  class  label  to  each  example. 
Combining  methods  for  classification  typically  assign  weights  to  each  learned 
model  and  take  the  “weighted  majority”,  i.e., 

fix)  =  =  c|i 

where  y  is  the  set  of  possible  classes,  and  ||a  e  6l|  is  one  of  o  is  equal  to  b,  and 
zero  otherwise. 

Initial  research  in  this  area  [Merz,  1995]  has  indicated  that  the  naive  ap- 
proadi  of  taking  the  most  frequent  class  (i.c.,  using  uniform  weights)  is  quite 
effective.  However,  when  the  learned  models  tend  to  make  uncorrelated  errors, 
more  elaborate  w^eighting  methods  may  do  better.  In  an  attempt  to  explore 
weiglit  sets  analogous  to  those  found  in  the  regression  task,  two  methods  are 
currently  under  development. 

The  first  is  a  direct  extension  of  PCR*  where  each  class  is  considered  a 
separate  (0/1)  regression  problem.  The  *  —  th  model's  prediction  for  the  j  —  th 
class 'would  be  /ijf(x).  A  zero  for  fij{x)  would  indicate  that  example  x  is  not 
predicted  to  be  a  member  of  class  jf,  and  a  one  would  idicate  the  opposite.  One 
weight  set  would  be  derived  for  cacli  of  the  J  classes  as  follow-s: 

<=1 

III  deriving  the  combined  estimate,  fix),  a  slightly  more  elaborate  weighted 
majority  sdicme  than  given  above  is  used.  In  this  case,  each  learned  model 
has  a  particular  weight  for  each  possible  class.  A  preliminary  evaluation  of 
this  extension  of  PCR*  indicates  that  it  also  does  a  good  job  of  choosing  the 
appropriate  weight  set  for  a  given  data  set. 

The  second  approa<h  being  explored  is  ba.sed  on  a  statistical  procedure 
known  as  “correspondence  analysis”.  Here  the  examples  ondthc  learned  mod¬ 
els  are  .ccalod  into  the  same  geometric  space.  In  this  representation,  relation¬ 
ships  brtween  examples  and  learned  models  may  be  exploited  so  that  a  learned 
model  s  weiglit  may  be  increased  for  examples  on  which  it  is  likely  to  be  accu¬ 
rate. 
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